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Abstract 

The most important families of non-linear codes are systematic. A brute-force check 
is the only known method to compute their weight distribution and distance distri- 
bution. On the other hand, it outputs also all closest word pairs in the code. In the 
black-box complexity model, the check is optimal among closest-pair algorithms. In 
this paper we provide a Grobner basis technique to compute the weight/distance 
distribution of any systematic non-linear code. Also our technique outputs all closest 
pairs. Unlike the check, our method can be extended to work on code families. 

Keywords: Grobner basis, distance distribution, Hamming distance, non-linear 
code. 



1 Introduction 

In the celebrated paper [Sha48] by Shannon, the mathematical founda- 
tion of coding theory was laid. Codes presented in that paper are non-linear, 
notably including codes used in the proof of the landmark Capacity theorem. 
Although no proof for linear codes of the Capacity theorem was known until 
the 60's ([Gal63]), coding theorists have been studying only linear codes, with 
a few exceptions ([Pre68],[BvLW83]). This is not surprising, since linear codes 
have a nice structure, easy to study and leading to efficient implementations. 
Still, it is well-known that some non-linear codes have a higher distance (or a 
better distance distribution) that any linear code with the same parameters 
([Pre68], [PHB98]). This translates into a superior decoding performance. 
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A class of non-linear codes that has received some attention is composed 
of systematic non-linear codes, since they are easier to encode. Moreover, 
the best known non-linear codes are systematic or equivalent to systematic 
ones ([BvLW83],[PHB98], [HKC + 94]), so that no performance degradation is 
shown while restricting to systematic codes. In this paper our main result is a 
method, based on Grobner basis computations, that allows to determine the 
(distance and) distance distribution of a systematic code. No other method 
is known, except for the "brute-force" approach consisting of checking the 
mutual distance of any pair of codewords. Both our method and the brute-force 
approach output the closest pairs, i.e. the codeword pairs whose distance is 
minimal. We show that the complexity of the "brute-force" approach matches 
the complexity of the closest pair problem and so this method is optimal (the 
proof is given within the black-box complexity model with distance oracle). 

2 Notation and preliminary results 

Let m > 1 be a natural number. Let IK be a field, IK be the algebraic 
closure of IK and / be an ideal in the polynomial ring K[Y] = K[yi, . . . ,y m ]. 
For any q > 1, we denote by E q [Y] C K[Y] the set of polynomials E g [Y] = 
{y\ ~ V\i ■ ■ ■ iVm~ Um} ■ Given a polynomial / G we denote by V(/) the 

set of all zeros of / in (K) m . Given an ideal / C K[Y], we denote by V(I) 
the set of all zeros of / in (K) m . Let S C (K) m . The set of all polynomials 
/ G K[Y] such that f(ai, . . . , a m ) = for any point (a±, . . . , a m ) in S forms an 
ideal in polynomial ring K[Y], called the vanishing ideal of S and denoted 
by 3(S). If L C K[Y], we denote by (L) the ideal in K[Y] generated by L. 

Let ¥ q be the finite field with q elements and (¥ q ) m be the natural Tri- 
dimensional vector space over ¥ q . 

Definition 2.1. Let 1 < t < m. We denote by M. mj t, q the following set: 
M. mM = {yh! ■■■Vh t | 1 < hi < . . . < ht < m}. 

From now on, we will use M m ^ instead of M m>ti9 . We will also shorten 
Uhi ■■■Vht to Y L , where L = {hi, h t }. 

Let s be an integer 1 < s < m — 1. We fix in ¥ q [yi, . . . , y s , t\, . . . , t m - s ] = 
¥ q [Y,T], the lexicographic order y± < y 2 < . . . < y s < h < . . . < t m - s . Let 
/ be an ideal in FJY, T] we denote by G(I) C ¥ q [Y, T] the minimal reduced 
Grobner basis of / w.r.t. < ordering ([Buc65,Buc06,CLO92]). 

Let : {¥ q ) k — > (¥ q ) n be an inject ive function and let C be lm(0). We say 
that C is an (n, k, q) code. Any c G C is called a word. Let rr : (¥ q ) n — > (¥ q ) k 
be 7r(ai, . . . , a n ) = (ai, . . . , a^). We say that C is systematic if (7ro0)(i>) = v 
for any v G (¥ q ) k . We denote by C(n, k, q) the class of systematic (n, k, q) codes. 

For any two vectors Vi,v 2 G (¥ q ) n , d(vi,v 2 ) denotes the (Hamming) 
distance between vi and v 2 - For any v G {¥ q ) n , w(t> ) denotes the weight of 
v. Let C G C(n,k,q), d(C) denotes the distance of C, Bi = Bi(C) denotes 
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the number of codewords in C with weight i. Integer set {B , B ± , . . . , B n } is 
called the weight distribution of C and A { = A^C) denotes the number of 
(unordered) codeword pairs with distance i. Integer set {Ai, . . . , A n } is called 
the distance distribution of C . 

A code C is distance-invariant if for any 1 < i < n and any c, d G C, 

\{yeC\d(c,y) = t}\ = \{yeC\d(c',y)=i}\. 

Clearly linear codes are distance-invariant. The distance distribution of distance- 
invariant codes (containing the zero vector) can be immediately obtained from 
their weight distribution. Note that many optimal codes are distance-invariant 
codes ([HKC+94]). 

We want to show some relations between some sets of vectors in (¥ q ) n 
having a weight property and the Grobner bases of associated ideals. We use 
polynomial ring ¥ q [Y] with any term-order, since our results here hold for any 
(admissible) term-order. 

The elementary symmetric functions are the polynomials o~\ = yi + ■ ■ ■ + 
y m ,...,cr m = yxViVz • • • y m -2V m -\V m - 

Definition 2.2. Let t G N be s.t. 1 < t < m. We denote by I ni:t the ideal: 

I m>t = (K . . . , a m } U E q [Y\) C ¥ q [Y] . 

Our aim is to determine the reduced Grobner basis of I mtt . For this, we 
need a preliminary result. 

Lemma 2.3. Let I G N be such that 1 < I < m — 1. Then 

m„ m c (^,,+1 u {a,} us,[r]). 

Proof. Let / be the ideal I = (M m , i+ i U {a t } U E q [Y] ) . 
Let A = {A C {1, .., m} \ \A\ = 1} and L = {i u . . . , i{\ G A. We have: 

= E E ^ c^c 1 E E ^- (!) 

AM B£A AEA BeA 

{h}CA {h}£B {il}CA {il}£B 

We note that y^ Yl^y^B is a sum of monomials in M mj j+i, hence 

^r 2 (^i E^kb^s) = C'E^b^ is in J - % (!)» since 2/r 1<7 « e J ' we 

obtain that 

vV E ^ G J - 

(•i}ca 

From (1), since any monomial in E{u}cA^4 contains y^, by reduction w.r.t. 
-E 9 [Y] we have a^a Y a G J. We can write: 

{n}cA 

E = E y b+ E ^, (2) 

A(zj\ B£A CGA 

{n}CA {«1,«2}CB {«l,«2}SZCr.{«l}CC 
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and from (2) we get 

vVY, Y * = vV E y b+vV E y c- 

ACA BeA {i\,i-2}t- c 

{h}cA {i 1 ,i 2 }cB {n}cc,ceA 

Since y^YiVfW Y b = yV^iVvt Z){<i.*2K* *b) and ^ Z){*i.*aKfl *b is in J, 

{•l}CB {'l}CS {ii}CS 

then the former is in 7. 

Similarly, y\~ x Z^ i 2 } C B^B is in I, and by reduction w.r.t. E q [Y] we have 

E ^ e 7 - ( 3 ) 

BeA 

{>1,12}CB 



In the same way, restarting from (3), we will eventually deduce that T^dea Yd 

_ LCD 

is in I. Since \D\ = \L\, then J2lcd Yd = Yl = • • -y^, i.e. it is an element 
of M m ,j. But L is a generic element of A and M m ,z = {Y L } LeA . □ 

We now determine the reduced Grobner basis for J mjt . 

Theorem 2.4. Let t e ~N be such that 1 < t < m. Let G(I myt ) be the reduced 
Grobner basis of l m ,t- Then: 

G{I m , t ) = E q [Y] U M m , t , fort> 2, 

G(I m ,t) = {yi, • •, ,y m }, fort = l. 

Proof. The statement is easily proved by checking Buchberger's criterion, 
that is, that all S— polynomials coming from G(I m>t ) are reduced to zero by 
reduction via G(I m<t ). □ 

An obvious consequence of Theorem 2.4 is that M mj t C I m ,t- For any 
1 < i < m, we denote by P the set p = {c G (F q ) m | w(c) = i }, and by 
Qi the set Qi = ^o<j<iPj- Set p contains all vectors of weight i and set Qi 
contains all vectors of weight up to i. In the next theorem we describe the 
Grobner basis of the vanishing ideal of Qi. 

Theorem 2.5. Let t be an integer such that < t < m — 1. Then 

hence the reduced Grobner basis G of 0(Qt) is 

G = E q \Y]\JM m , u fort>l, 
G = {yi,...,y m }, f O rt = 0. 

Proof. It is enough to show both inclusions 

({<j t+1 , ...,a m }UE q [Y])C 3(Q t ), V(({a t+1 , ...,a m }U E q [Y])) C V(3(Q t )) . 
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For any c in Q t , we have c G (¥ q ) m and a t+1 (c) = 0, . . . , <J m (c) = 0, hence 
(br m ,...,a m }U C 3{Q t ). 

Since V(J((5t)) = Qt, we need to show 

aeV(({a t+u ...,a m }UE q [Y])) a E Q t - 

We use the relation (F g ) m = P U Pi U . . . U P m and we observe that 
V(({cr t+ i, . . . , <r m } U Pg[F])) 7^ 0, since it contains the zero vector. 
Suppose by contradiction that there is an a G Pt+\ U Pi+2 U . . . U P m such that 
cr t+ i(a) = 0, . . . , <J m (a) = 0. Since a has weight r, with r > t + 1, it follows 
that there exists a monomial m G M mir such that m(a) 7^ and m(a) =0 for 
any other monomial m in lil m , T - Hence o>(a) 7^ 0, contradicting the hypothesis 
aeV({{a t+1 ,...,a m }UE q [Y})). □ 

3 A distance-computing algorithm 

In this section we propose a computational method to find the weight 
distribution of a code C in C(n, k, q). We also extend this method to find the 
distance and the distance distribution of C. From now on, t will be understood 
to satisfy 1 < t < n and C will denote a code in C(n, k, q). 

3.1 Grobner basis of a non-linear systematic code 

We apply our previous results to give a structure for the Grobner basis of 
C. Let n,keN and ¥ q [X, Z] be as in Section 2. We also use E q [X] C ¥ q [X, Z] 
with the obvious meaning. 

We can view C as a set of points in (F 9 ) n C (W q ) n and hence as a 0- 
dimensional variety, so that J(C) is its vanishing ideal in FjX, Z\. We describe 
the reduced Grobner basis of J(C) w.r.t. lex. 

Theorem 3.1. Lei (7 be the reduced Grobner basis for I = J(C) u;.r.t lex 
with xi < • • • < Xk < Z\ < • • • < z n -k- Then: 

G — E q [X] U {zi — fi, . . . , z n _ fc — f„-fc} 

for some fj G F 9 [X], 1 < j < n — k. 

Proof. Since C C (F g ) n and C is systematic, PJX] C G. The existence of 
polynomials Zi — fj follows from the fact that any non-systematic component 
depends only on the X block of variables. □ 

Example 3.2. Let C be the following (4,2,2) code: 



C = {(0,0, 0,1), (0,1, 0,1), (1,0, 0,1), (1,1, 0,0)} 
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The reduced Grobner basis of 0(C) C F 2 [xi, x 2 , Zi, z 2 \ w.r.t. the lex ordering 
with xi < x 2 < Z\ < z 2 is G(C) = {x\ + xi, x\ + x 2 , z±,z 2 + X\X 2 + 1} . So 
f i = and f 2 = x\x 2 + 1. 

Any set of polynomials endowed with the structure of Theorem 3.1 is 
a Grobner basis for the ideal generated by itself, and such ideal is zero- 
dimensional. Hence, the corresponding variety is finite and satisfies the prop- 
erties of the codes in G(n, k, q). In this sense we can identify polynomial sets 
(with the structure of the theorem) and codes in C(n, k, q). We make it explicit 
in the following theorem. 

Theorem 3.3. Let A ktJl be the set 

A Kn = {(h, . . . , f n _ fc ) | fj : {¥ q ) k — > F„ 1 < j < n - k}. 

There is a bijection A^ n <-> C(n, k, q) given by 

(h, ■ ■ ■ , fn-fc) i > G = E q [X] U {Zi — fi, . . . , Z n _fc — f„_fc} • 

3.2 Weight distribution for non-linear systematic codes 

The first computational method we propose is a method to obtain the 
weight distribution for C . 

Definition 3.4. Let G(C) be the reduced Grobner basis for C , G(C) = E q [X]U 
\z\ — fi(X), . . . , z n -k — fn-k(X)}. We denote by Wq the following ideal in 
Fjxi, . . . ,x k ]: 

W' c = (E q [X\ U{m( Xl ,...,x k ,h(X),..., f n _ fc (X)) | m e M n , t } . 

Lemma 3.5. V(W C ) ^ 3c e C s.t. w(c) <t-l. 

Proof. Let a G V(W^), a = (a±, . . . , a k ). We have that c is in C, where 
c = (ai, . . . , a*;, fi(a), . . . , f n - k (a)). Since a e V(W^), we have 

m(a) = m(ai, . . . ,a k ,h(a), ■ ■ ■ ,f„_ fc (a)) =0 Vm G M„, t . 

Which means c G V(/ mjt ), hence c G Qt-i (Theorem 2.5), i.e. w(c) < t — 1. 
The converse can be easily proved by reversing our previous argument. □ 

Lemma 3.5 shows that a point in V(Wq) matches a codeword c in C with 
w(c) < t — 1, from which we can easily derive the main result of this subsection. 

Theorem 3.6. £ t _i = |V(W* C )| \ |V(W* C x )| . 

If C is a distance-invariant code, from the weight distribution of C we can 
immediately get its distance distribution. 
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3.3 Distance and distance distribution of non- linear systematic codes 

We now propose a computational method to find the distance of C. 

Definition 3.7. Let f 1; . . . , f n _ fc be as in Theorem 3.1. We denote by ¥ q [X, X] 
the polynomial ring ¥ q [xi,x 2 , ■ ■ ■ , x k , Xi, x 2 , . . . , x k ]. I n the polynomial module 

(w q [X, X]j , we denote by L n>k;t the polynomial vector: 

L n ,k,t = (xi ~ xi, . . . , x k - Xk, h(X) - h(X), f n -k(X) - fn-k(X) j . 

Definition 3.8. Let fi, . . . ,f n -k be as in Theorem 3.1. We denote by T c the 
ideal in W q \X,X\ generated by : 

{xi - Xi, £i q - Xi I 1 < i < k} U {m(L n>k>t ) \ m e M rM } . 

In (F q ) k x (F q ) k we denote by A k the diagonal, i.e. the set of points a = 
(ai, . . . , a k , di, . . . , d k ) such that a, = ctj, 1 < i < k. 

We need to take into account the diagonal because, clearly, A k C V(J^) 
(note that A k = V(J^)). 

Theorem 3.9. V(T C ) ^ A k 3 c u c 2 E C such that d(ci, c 2 ) < t - 1 . 

Proo/. Let (a, a) G (F (? ) fc x (W q ) k \A k . We take two codewords c, c such that: 

c = (ai, ...,a fc ,fi(a), . . . ,f„_ fe (a)), c = (fii, . . .,a k ,h(a), . . . ,f n _ fc (a)) . 

From this, we obtain that 

(a, a) e V(J* C ) ^ A fc ^ m(c - 5) = 0, c / c, in 6 ^ , 

since c - c = (a ± - Si,...,a fc - a k ,fi(a) - fi(a), f n _ fc (a) - f n _ fc (a)). But 
m(c — c) = Vm e M„ )t means c — c G V(/„ )t ), which is equivalent to w(c — c) < 
t — 1 (Lemma 3.5). □ 

We can easily derive the following result. 

Corollary 3.10. V(J' C ) = A k d(C) > t. 

From Corollary 3.10, an algorithm is directly designed to compute the 
distance of C. 

J = 2 

While V(y c ) = A k do 

j :=j + l] 
Output j — 1 
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Example 3.11. Let C = {[0, 0, 2, 0], [0, 1, 0, 0], [0, 2, 0, 2], [1, 0, 2, 2], [1, 1, 2, 1], 
[1,2, 1,2], [2, 0, 1, 0], [2, 1, 1, 1], [2,2,0,1]} be a (4,2,3) code. The Grdbner 
basis G(C) G F 3 [xi, x 2 , z±, z 2 ] w.r.t. lex x\ < x 2 < z\ < z 2 is 

G(C) = {x\ — Xi,x\ — X2,Zi—x\+x\x2—x\+X-L + \,Z2—x\+XiX2+X'i—x\—X-i}. 

So fi = x\ — x\x 2 + x\ — x\ — 1 and f 2 = x\ — — £ 2 + xf + x\ . We choose 
as monomial order in F3[xi, X2, Xi, x 2 ] the degrevlex order with x\ > x 2 > 
x\ > X2- Computing the Grdbner basis of "Jq, we have G{3 2 c ) = {x\ — x±, x 2 — 
X2, x\ — xi, x\ — X2, xl — Xi, x 2 — x 2 }. Since xi — xi, x 2 — x 2 G G, then V(J^) 
is the diagonal, so d > 2. Computing then G{^%), we find 

G("Jq) = {xl~X 2 , xf—Xl, x\~X 2 , xf—Xl, X\X\X 2 ~X\X 2 , X\X 2 X 2 -X 2 X\X 2 , XlX 2 Xl-X 2 Xi} . 

This time x\ — x\ G(0q) and the variety V(3^) / A3, which implies d = 2. 

Theorem 3.9 suggests a way to compute the distance distribution of C. 
Indeed, we showed that a point in V(3q) is a pair of codewords with distance 
less than t. From that, we have the following. 

Corollary 3.12. V(J' C ) = {( Cl , c 2 ) | c x ± c 2 G C, d(ci, c 2 ) < t - 1} U A fe . 

Let Ci, c 2 G C be such that d(ci, c 2 ) = i, for 1 < % < t — 1. In V(J^) there 
is both the point that matches the pair of words (ci,c 2 ) and the point that 
matches the pair (c 2 ,Ci). The following fact is then obvious. 

Fact 3.13. 



Example 3.14. Let C = {[0, 0, 0, 0], [0, 1, 0, 0], [0, 2, 0, 0], [1, 0, 0, 0], [1, 1, 0, 2], 
[1, 2, 0, 2], [2, 0, 0, 2], [2, 1, 0, 0], [2, 2, 0, 0]}. Clearly, C is in 6(4, 2, 3). The dis- 



tance distribution of C can be determined by hand: A\ = 8, A 2 = 20, A 3 = 
8, A 4 = 0. We want to compute all pairs of words (ci, c 2 ) with d(ci, c 2 ) < 2. 
To accomplish this, we start from the input basis of ideal 3^: 

/ O O OO O O OO o o 

= (x 2 xfx 2 — xfxfX2 — X\X\X2 — xjx 2 ~ X\X 2 X\X 2 + X 2 X\X 2 + X\X\X 2 ~ 

x\X2 — x\x\x 2 — x\x 2 + x\x 2 + x\x 2 x\ + x\x 2 x\ — x\x 2 — x\x 2 , xfx 2 xfx 2 + 
xix 2 xfx 2 — xfx 2 x 2 — X1X2X2 — x\x\x\ — X\x\x\ + x\x\ + x±x 2 , x 2 — x 2 , x 2 — 

X2, x\ — X±, xf — Xl, X 2 xf X 2 + X 2 XlX 2 — xfx 2 X 2 — XlX 2 X 2 — x\x\x2 — X\x\x2 ~ 

x\x2 — x\x\x,\x 2 + x\x\x 2 — x\x 2 + xix 2 x 2 — x\x 2 + x\x 2 + x\x 2 x\ + x\x 2 x\ — 

t V * 2 - 1 « • V • / V • 'I ' /-ya 2 . y . 2 '| ' 2 -| i 2 | / V • 2 ,'| j -V' 2 | | ' — | ' • V • 2 .'| ' /-ya 2 '| ' '| ' 2 . y . 2 | /-ya 2 ,ya 2 • V « V* — 

ju ^Ju 2. t t ]^ x 2 5 J.- 1 ^ 2 12 ' 2 1 2 ~ -X ^ -X' i x 2 juyju^ 1 2 2 ~ 12 ^1^2 

x 2 xf x 2 — x 2 xix 2 + xf x 2 x 2 + xix 2 x 2 — x\x\x\ — x\x\ — x\x\x,\ + x 2 xi + xf x 2 + 
x\x\,x\x 2 x\x\ + x 2 x\x\ — x\x 2 x\ — x\x 2 x\ — x\x\x 2 + xfx 2 — x\x\x\x 2 — 
x\x\x\x 2 — x\x,\x 2 + x\x 2 + x\x 2 x 2 + x±x 2 x 2 + x\x 2 x\ — x 2 x\ — x\x 2 x\ +X 2 X\) 



From the Grdbner basis we find that |V(J^)| = 65 ([BCRT93]). Since |A 2 | = 9, 
we have A x + A 2 = (65 — 9)/2 = 28, in accordance with known values. 




l<i<t-l 
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We can easily get an explicit formula for a generic A t , since = 

2£*r 1 ^ + iA fc |. 

Theorem 3.15. 

1 2 
We provide a last example. 

Example 3.16. For the same code of Example 3.14, we find A2. 

"Jq = (X1X2 — X\X2 — X2X\ +X\X2, x\ — X2, x| — X2, x\ — X\, X\x\ — X\x\, X\x\x2 ~ 

— ~2 2 ~ 2 2 2~ 2~ ~ 22 2 

X~iX2]%\ X<2%\ X\ ~\~ X\db<2 3>1 ~\~ X\^X\X*2*C\ X^X\ ~\~ X\X\ X^X<2 X-^ ~\~ 

X\,x\x2X\ + X\X2X\ - X\X2 ~ X\X 2 ) 

From the Grobner basis we obtain that |V(3^.)| = 25. Applying Theorem 3.15, 
we can thus conclude A 2 = (65 — 25) /2 = 20. 

4 Complexity considerations 

The complexity of our method (Corollary 3.10) is basically the complexity 
of the computation of a Grobner basis for 3^, 2 < t < d+ 1, which is the core 
of the algorithm. The ideal requiring a bigger computational effort is obviously 
"Jq with the largest t that we have to compute. From now on, we try to consider 
the ideal 3 l c where the f polynomials are as generic as possible. 

To compute the distance, we do not need the exact structure of the 
Grobner basis, but we just want to know if V(J^) is the diagonal. We ex- 
perimentally found that, if V(jy is the diagonal, then the basis obtained by 
auto-reductions is the diagonal as well. Thus, in our case the complexity of 
the Grobner computation seems to be equivalent^] to that of the computation 
of an inter- reduced basis from the proposed generator set for 3^. Hence, we 
focus on the complexity estimation for the relevant inter-reductions. 

We want to estimate the complexity to get an inter-reduced basis, starting 
from the input basis of generators for 3^. We will use some experimental re- 
sults. Let L be a set of polynomials that we want to inter- reduce. Let N = \L\. 
A single reduction involves an element / of L that is reduced w.r.t. L \ {/}. 
The output is either 0, in which case L becomes L \ {/} (and iV becomes 
iV — 1), or / itself, if it cannot be reduced, or a new polynomial /', which is 
the remainder and which replaces / in L. So, in the worst case, any reduction 
requires a division by iV — 1 polynomials and generates a new polynomial /'. 
Note that the L size cannot increase. The starting step requires then, in the 
worst case, N ( N ^~ 1 *) divisions. Let r be the number of new polynomials which 
are obtained by reductions, both at the starting step and at any other sub- 
sequent step. Since iV does not grow, any of these r new polynomials has to 



They give different results, except when they output the diagonal. 
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be divided by no more than N — 1 polynomials. In conclusion, the worst case 
estimate gives JV( -^~ 1 ^ + rN divisions, and 2 n n is clearly the (worst case) cost 
of any division. 

In our case, N — (?) = ( 2 fc fe ) . By induction, we can derive the following lemma: 

Lemma 4.1. Let n, k G N, with 1 < k < n, and a G Q. If s(a) is such that 
1 _|_ 2 a < 2 s{ - a \ then: 

We denote by N the number of polynomials in 

Sgen = {xl - Xi, £i q - £i \ 1 < % < k} U {m(L njk ,t) I m G M n , t } ■ 

We treat the hard case, i.e. when n = 2k and d ~ k. Despite our worst case 
formula, our computations suggest that it is possible to assume: 



iV initial divisions instead of 



N(N-l) 



2 ' 

• r = \/2 2k = 2 k (2 2k is the number of all possible monomials), 

• 2 k n instead of 2 n n, the average cost of each division, 

• the average number of divisions for any new polynomial /'. 

Thus, the computational cost C to produce an inter-reduced basis from S gen 
can be estimated in 



2 3fc + 2P2P 



2k 

2k = (2 3fc + 2 3fc )2fc = 2 3k+1 2k, 



where we applied Lemma 4.1 with a = 1 and s(a) = La 2 1 We would like 
to compare our estimates with our numerical results. In order to do that we 
highlight the asymptotic exponential behaviour in k. 

Definition 4.2. Let f, g : R i-> R s.t. f(m) > 0, g(m) > for m > 1. We 
say that f ~ g if and only if there exist mi, m.2, cxi, U2 > 1 snc/i i/iai 

f(m) < g(m)m ai m > rri\, 9{ m ) — f{m)m a2 m > m 2 . 

It is easy to see that ~ is an equivalence relation and, given / ~ 2 an and 
g ~ 2 f3n , for a, (5 > 1, a, (5 G R, then / ~ g if and only if a = /3. Moreover 
a > (3 implies limn^oo 4 = 0. 

With this notation, our previous estimates for C can be written as C ~ 2 3fc . 

We tested our problem using different computer algebra systems (Magma 



actually s(a) is slightly larger. 
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2.10.13 [MAG], Polybori 0.3.1 [pol], Singular 2.0.6, Singular 3.0.4 [GPS07]). 
For any system the time needed has a behavior of kind 2°'° , with a depending 
on the system. In particular, we report the graph where x = k and y = 
l°g( fi mTjn'fc-i )' so ^ na ^ ^ ne y va l ues represent the expected exponent. This 
suggests us the following values for the computational costs: 2 7k for Magma 
2.10.13, 2 4 5fc for Singular 2.0.6, 2 3 ' 5fc for Singular 3.0.4, 2 3fc for Polybori 0.3.1. 
We note that a brute-force check of the distance has an asymptotic behaviour 
like 2 2k . Unfortunately, it is impossible to find an algorithmic] that computes 
the closest pairs of a code with a lower complexity, since 2 2k is indeed the 
complexity of the problem (see Section 4.1). admittedly, 2 2h looks better than 
our estimate 2 3k , but two comments are in order: 

• If we examine the drastic improvement obtained by the evolution of com- 
puter algebra systems (from 2 7k to 2 3k in few years), we can reasonably 
assume that our estimates are still pessimistic and that further improve- 
ments in software development will allow our method to run like 2 ak , with 
2 < a < 3, possibly close to a = 2. 

• The brute-force check can output the distance only if the code is given. 
When a family of codes is given, that is, the {£} depend on some param- 
eters the check is inapplicable, but our approach may be able to 
give general results for the family, such as a lower bound on the distance 
independent of the A/s or even a "a priori" bound dependent on the A/s. 

4-1 Complexity of the closest-pair problems 

The determination of the minimum distance of a non-linear code is an in- 
stance of the more general "closest pair problem", which is studied for general 
metric spaces and, more deeply, for the Euclidean spaces M n . Several related 
problems are studied in this context, such as the "nearest neighbour problem" 

3 The situation in the linear case is totally different. The problem of finding the 
distance is NP-complete ([Var97]), so no sub-exponential algorithm is known, but 
it might exist. 
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(which is the "decoding problem" in coding theory). An excellent reference is 
[PS85], to which we implicitly refer when we do not give explicit definitions 
or quotations. 

From now on, let S be the number of points we are considering. In the 
breakthrough 1975 paper [SH75] the complexity of many of these problems 
was established for the planar Euclidean case (i.e., for M. 2 ). In particular, it 
was shown that it is possible to solve the closest-pair problem in Slog(S) 
steps, by a clever application of the Voronoi diagrams. And it was also shown 
that Slog(S') matches exactly the complexity of the problem. Since S 2 is 
the complexity of the "naive approach! 4 r . one might think that the naive 
approach could be beaten also in other metric spaces. Later, more refined 
algorithms have appeared that solve the closest-pair problem over IR n with 
a claimed Slog(S) complexity. However, these algorithms' complexity is ac- 
tually S\og(S)f(n) for some function / : N — > N, that is, their complexity 
is computed by considering the space dimension fixed. The fact here is that 
computational geometers are interested in cases when the number of points 
is much larger than the space dimension. In the Hamming space (F 2 ) n it is 
crucial to write explicitly also the dependence on n, since clearly S < 2 n . A 
deep analysis of their proofs shows that / is exponential in n (e.g., about 4 n in 
Suri's divide-and-conquer algorithm [Sur09]). Translated into Hamming space 
language, this means that the naive algorithm performs no worse than the 
others. In fact, even if we allow for / = 2 n , we will get an overall complexity 
of S\og(S)2 n = 2 k ■ k ■ 2 n ~ 2 3k (2 2k is the complexity of the naive approach). 
Of course, we would expect that the dependence on the dimension could be 
improved. 

Remark 4.3. The instance in (F 2 ) n of the closest-pair problem is actually 
different from the "distance computation problem" , since the latter needs only 
to output the value of the distance and not the closest pair. However, both our 
Grobner basis algorithm and the naive approach do output the closest pair/s, 
and so we will consider the former from now on. 

When not dealing with Euclidean spaces (or metric spaces embeddable 
with an isometry into R"), the standard model in the literature for these 
problems is the "black box with distance oracle" . Basically it means that the 
complexity is computed in terms of the number of distance calculations which 
are necessary (a "call to the distance oracle" means a distance calculation). 
As an example, it is possible to adapt the proof of Theorem 2.3 in [KL05] to 
prove the following 

Theorem 4.4. The (worst-case) complexity of decoding non-linear codes in 
(¥ 2 ) n isQ(S). 

Before we prove it, we need to explain the meaning of a "proof". We are 
assuming that there is an algorithm A, which accepts as input both a non- 



4 this is the way the "brute-force check" is called in computational geometry. 
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linear code X of size S and a query q, i.e. a point for which we must find 
the point/s in X closest to q. Algorithm A is the best possible algorithm. It 
calls the distance oracle many times and tries to use the information on the 
computed distances to avoid making other distance computations. What A 
can use is the triangle inequality: 

d(x,y) < d(x,z) +d(y,z), d(ar, z) > d(x, y) - d(y, z) (4) 

since the algorithm is searching a distance minimum, it will use the right-hand 
version of (4). To prove Theorem 4.4 we need to exhibit, for any n sufficiently 
large, a code of length n and a query q such that A is forced to perform Q(S) 
distance computations. 

Proof. Let £ = |_§ J - Let X n be the code in (F 2 ) ra for n > 4 with query q n 

X n = {ce (¥ 2 ) n | w(c) = £} U {P = (1, 0, . . . , 0)}, q n = {(0, . . . , 0)}. 

In other words, q n is the zero vector and X n is the sphere centered in q n of 
radius £ plus a point P at distance 1 from q n . The crux of the proof here is 
that A will always get £ from its distance computations, except for d(q n ,P), 
which will give 1. So, it does not matter how smart A is, it will not able to 
use any of its former distance computations. Therefore, in the worst case, A 
has to try all d(q n , x) for x G X. □ 

Note that the size of X n in the above proof grows with n (actually \X n \ > 2 n l 2 ). 
Without this property, we could take X n with two points and then of course 
any algorithm will need to perform 1 = S — 1 = Q(S) distance computations. 

The previous digression is important in our opinion to put into context 
the problem and understand the proof of our last result in this section, which 
is the following theorem. 

Theorem 4.5. The (worst-case) complexity of computing the closest codeword 
pairs of a binary non-linear code is Q(S 2 ). 

Proof. For any subset Y of (F 2 ) n we denote by D its diameter, that is, the 
maximum distance between two points in Y, and by d its (Hamming) distance. 
Let !B be any algorithm having as input a non-linear code and returning a pair 
of closest codewords (we can think of B as the best possible). To prove our 
claim we need to show that for any sufficiently large n we can find a code X n 
such that 23 cannot use any distance computations already performed in order 
to discard other distance computations. 

We consider n > 10. We take X n as any maximal subset of (F 2 ) n such 
that its aspect ratio is strictly lower than 2. The bound obtainable by "B 
from any of its former computations is at most 

d(x, z) > max d(x, y) — min d(y, z) = D — d < 2d — d = d . 

This cannot give any help to !B and so 23 is forced to compute also d(x, z). □ 
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We would like to give one final remark. The size of code X n in the previous 
proof clearly grows with n, but it will have only a few words. If one is inter- 
ested in building a larger code on which B needs to compute Q(S 2 ), then one 
will need to use Ramsey-like properties of the Hamming space ([BLMN05]), 
similarly to what is done in [KL05]. To say it in a few words avoiding tech- 
nicalities, it is possible to find a subset Y of (F 2 ) n with a size 2 Q (") and such 
that its aspect ratio is limited to 2 + e, for some small e. Since the number 
of minimum-distance word pairs and the number of maximum-distance word 
pairs may be "small" compared to the number of all word pairs, it follows 
that 23 may have to examine in the worst case a number Q(S 2 ) of word pairs 
before being able to use (4). However, we feel that a complete proof for this 
claim is out of the scope of this paper and so we do not delve into it. 



5 Conclusions 

The decoding performance of a distance- invariant code (e.g., a linear code) 
depends on its weight distribution, although already the distance gives partial 
information on it. For a generic non-linear code the performance depends on 
its distance distribution (but the distance provides significant information). 
In this paper we have provided some Grobner techniques in order to compute 
the above-mentioned code parameters in the systematic case (which is the 
most interesting). We realize that no method can be faster than running a 
specific C-programme optimized for a given code and we are far from claiming 
that our techniques can compete with this approach. However, the optimized 
software has two drawbacks: 

• the software writing and debugging can be long and complex, (while our 
methods are very easy to implement using a software package for Grobner 
basis computations), 

• the software programme can be used only on a given code and cannot 
give general results on a code family (while our methods could). 

In our opinion this means that our methods can be of interest for a mathe- 
matician investigating theoretical code properties. 
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